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ABSTRACT 

The Canadian Cluster Comparison Project is a comprehensive multi-wavelength sur- 
vey targeting 50 massive X-ray selected clusters of galaxies to examine baryonic tracers 
of cluster mass and to probe the cluster-to-cluster variation in the thermal properties 
of the hot intracluster medium. In this paper we present the weak lensing masses, 
based on the analysis of deep wide-field imaging data obtained using the Canada- 
France-Hawaii- Telescope. The final sample includes two additional clusters that were 
located in the field-of-view. We take these masses as our reference for the compari- 
son of cluster properties at other wavelengths. In this paper we limit the comparison 
to published measurements of the Sunyaev-Zel'dovich effect. We hnd that this signal 
correlates well with the projected lensing mass, with an intrinsic scatter of 12 ± 5% at 
~ r25oo, demonstrating it is an excellent proxy for cluster mass. 
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1 INTRODUCTION 

Observational constraints on the various constituents that 
make up the universe have tightened tremendously in the 
past decade. However, many puzzles remain, most notably 
the origin of the accelerated expansion o f the universe (e.g., 
iRiess et al.1ll998l ; |Perlmutter etalll 19991 ). Another challenge 
is to explain the transition from a smooth early universe, 
as indicated by observations of the cosmic microwave back- 
ground, to today's highly structured and complex universe. 

Arguably, clusters of galaxies provide one of the most 
important pieces of this fascinating cosmic puzzle. Galaxy 
clusters are one of the primary reservoirs of baryons in the lo- 
cal Universe that can be studied using observations spanning 
the full electromagnetic spectrum. As such, they provide an 
excellent laboratory for studying the physics affecting the 
evolution of baryons, at least over the past 10 Gyrs. Finally, 
the number density of clusters is a sensitive function of key 
cosmological parame ters, including th e dark energy equa- 
tion of state (see e.g.. lAllen et alj|201ll . for a recent review). 
Hence, there is an active ongoing effort to use clusters as 
precision probes of these parameters. 



* Based on observations from the Canada-France-Hawaii Tele- 
scope, which is operated by the National Research Council of 
Canada, le Centre National de la Recherche Scientifique and the 
University of Hawaii. 



The use of clusters as cosmological probes and the study 
of the baryon physics to explain their observable properties 
are closely connected. The former requires cluster catalogs 
with well-defined selection functions that somehow need to 
be related to predictions. The predictions are based on nu- 
merical simulations of cold dark matter. As clusters are dis- 
covered through their optical, X-ray or radio properties, it 
is imperative that we understand the relation between the 
observables and the underlying dark matter distribution. 

Cluster samples are increasing rapidly thanks to op- 
tical surveys (e.g., iGladders fc Yee 20051; Koester et all 
| 2007f). X-ray stud ies (e.g., iReiprich fc Bohringerl 1200a 
Ebeli ng et all 120101) and Sunyaev-Zel'dovic h surveys (e.g., 



Williamson et alj|201ll ; | Marriage et al 1 l201lh . The detection 



significance correlates with mass and can thus be used as a 
mass-proxy, which nonetheless needs to be calibrated. This 
can be done using deep X-ray observations, under the as- 
sumption that the intra-cluster medium (ICM) is in hy- 
drostatic equilibrium. There is, however, both theoretical 
and observational evidence that that X-ray masses tend to 
be biased low if one assumes hydrostatic eq uilibrium (e.g., 
iNagai et al. I l2007l ; iMahdavi et~alll2008l . 120121 ). 

To compare the baryonic properties of clusters to the 
results from numerical simulations, we instead need a di- 
rect probe of the (dark) matter distribution. Such a probe 
exists in the form of weak gravitational lensing: the gravi- 
tational potential of the cluster perturbs the paths of pho- 
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tons emitted by distant galaxies. As a result, the images 
of the galaxies appear slightly distorted. The amplitude of 
the distortion provides us with a direct measurement of the 
gravitational tidal field, which in turn can be used to map 
the distribu tion of dark matter al ong the line-of-sight di- 
rectly (e.g., iKaiser fc Squiresi 1 19931 ). However, comparison 
with other mass traces does require one to make assump- 
tions about the geometry of the cluster, because weak lens- 
ing measures the total mass projected along the line-of-sight. 

Weak gravitational lensing is now a well-established 
technique to study the distribution of matter in the uni- 
verse. The applications range f r om the study of galaxy 
halos (e.g., iHoekstra et al.ll2004l ; iMandelbaum et al.l 12009 ; 
Ivan Uitert et alj|201ll) to the study of la r ge-scale structure 
(e.g. jHoekstra et al.ll2006l : iFu et ai.ll200sl : ISchrabback et all 



2010). 



Several developments in the past decade have also led 
to improvements in the weak lensing studies of galaxy clus- 
ters. For instance, the galaxy shapes are not only affected 
by gravitational lensing, but observational distortions can 
cause systematic signals that are similar in size. However, 
extensive tests have sho wn that techniques can now reach an 
accuracy of 1-2% (e.g iHevmans et al.|l200d; iMassev et al.l 



120071 ; iBridle et al.ll201ol ; iKitching et al.ll2012l ). Finally, the 



source redshift distributions, which are required to convert 
the lensing signal into a physical mass, are now much better 
known, compared to even a few years ago. 

It is possible to derive mass estimates by fitting para- 



data extend to small radii 



e.g Smail et al. 


1997l;lDahle et all 


:IJee et al.ll2011 


). This is also rel- 



evant for the study of high redshift (2 > 0.6) clusters, which 
can only be studied reliably using Hubble Space Telescope 
observations. At these high redshifts lensing masses are par- 
ticularly important, because the clusters are expected to be 
dynamically young. Fortunately the number of high redshift 
clusters for which weak lensing masses have been dete r mined 
has increased in recent year. For instance |jee et all (|201ll ) 
analysed a sample of 22 z > 0.9 clusters and found evidence 
for an evolution in the normalization of the relation between 
mass and Tx, the X-ray temperature. Clusters of galaxies, 
however, may show significant substructure or are not well 
described by the adopted model, whic h leads to biases in the 
mass estimates l|Hoekstra et al. 2002). As a consequence, it 
is important to measure the lensing signal out to large radii, 
which allows for more direct mass measurements. 

This is now possible thanks to wide-field image rs on 
world-class telescopes. For instance, iHoekstral (|2007f) pre- 
sented masses for a sample of 20 X-ray luminous clusters of 
galaxies that were observed using the CFH12k camera on the 
Canada- France-Hawaii Telescope (CFHT). A s part of the 
Local C luster Substructure Survey (XoCuSS). lOkabe et alJ 
(2010a) presented results for a sample of 30 clusters with 
0.15 < z < 0.3 using Subaru data. As part of a weak lensing 
follow-up program of c luster s discovered by the South Pole 
Telescope, iHigh et all i|2012l) presented results for an initial 
sample of 5 clusters observed with the Megacam imager on 
the Magellan telescope. 

To fully exploit the statistical power of cluster sur- 
veys, it is timely to increase the sample of clusters for 
which accurate weak lensing masses are available. I n thi s 
paper we update the results presented in iHoekstr 1 (120071 ) 
and augment the sample with 30 clusters with redshifts 
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Figure 1. X-ray luminosity within rsoo in the 2 — 10 keV band 
(top) and temperature within r2soo (middle) as a function of clus- 
ter redshift. The distribution of redshifts is shown in the bottom 
panel. 



0.15 < z < 0.55 that were observed using MegaCam on 
CFHT. This sample forms the basis for the Canadian Clus- 
ter Comparison Project (CCCP), which is a comprehensive 
multi- wavelength study of these massive clusters of galaxies. 

The structure of the paper is as follows. In §2 we present 
the data and discuss the data analysis. The weak lensing 
analysis is discussed in §3 and the mass measurements are 
presented in §4. In §5 we compare our results to measure- 
ments of the Sunyaev-Zel'dovich Effect. Throughout the pa- 
per we assume a cosmology with £l m — 0.3, Qa = 0.7 and 
H = 70ft 70 km/s/Mpc. 



2 CLUSTER SAMPLE AND OPTICAL DATA 

The main objective of the Canadian Cluster Comparison 
Project (CCCP) is to study the different baryonic tracers 
of cluster mass and to explore insights about the thermal 
properties of the hot diffuse gas and the dynamical states of 
the clusters that can be gained from cluster-to-cluster varia- 
tions in these relationships. An important aspect is to com- 
pare to accurate weak lensing masses for the clusters, which 
requires deep data with good image quality over a wide field- 
of-view, for which we use the Canada-France-Hawaii Tele- 
scope (CFHT). 

The starting point of the project is the sample of 20 
clusters, for which deep archival B and R band observations 
with the CFH12k camera were available (t he first 20 entrie s 
in in Table [H). This sample was studied in IHoekstral l|2007h ; 
iMahdavi et al.l (|2008l ). Nearly half of these clusters were 
originally observed by the Canadian Network for Observa- 
tiona l Cosmology(CNOCl; lYee et al.l fl996l ; ICarlberg et"aH 
Il996l ) and comprise the brightest clusters in the Einstein 
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Observatory Exte nded Medium Sensitivity Survey (EMSS; 
iGioia et al.il 1990l ). 

To improve the statistics, this initial sample was aug- 
mented with an additional 30 clusters (listed in Table [1]) 
that were observed using Megacam on CFHT in the g' and 
r'-band. To ensure a significant detection of the lensing sig- 
nal, clusters with an ASCA temperature of ksTx > 5 keV, 
and redshi fts 0.15 < z < 0.55 were selected based on the re- 
sults from lHorneil (l200ll). N ote that 4 of the EMSS clusters 
studied by IHoekstral (|2007l ) do not meet the X-ray tempera- 
ture criteriorQ Finally Table [1] lists two additional clusters 
(Abell 222 and Abell 1234) that were located in the observed 
fields. 

In this paper we therefore present weak lensing masses 
for 52 clusters of galaxies. The imaging data have been used 
in a number of related studies. For instance, the surface 
brigh tness profiles of the b rightest cluster galaxies were stud- 
ied in lBildfell et~al1 l|200St ). The CCCP sample also provided 
the high redshift subse t in a study of the evolution of the 
dwarf-to-giant ratio bv lBildfell et al.l |2012). 

The X-ra y ana lysis is described in detail in 
iMahdavi et al.l l|2012h and the resulting X-ray lumi- 
nosity and temperature as a function of redshift are 
presented in Figure Q] (top and middle panel, resp.), as well 
as the distribution of cluster redshifts (bottom panel). The 
sample is dominated by clusters at z ~ 0.2, with a tail 
out to z ~ 0.5. This mostly reflects the limitations of the 
input ASCA catalog. It is important to note, however, that 
no selection was made based on the optical properties or 
the dynamical state of the cluster. Interestingly, al t hough 
the sample is by no means complete, iMahdavi et al.l (|2012T ) 
find that the X-ray properties appear to be representative. 
The statistics of the Lx — Tx relation of the CCCP sample 
are indistinguishable from samples wi th well-characterized 
selection functio ns, such as MACS llEbeling et al.l l201Ch 
and HIFLUGCS l|Reiprich fc Bohringdl2002h . 

For the clusters observed with Megacam and z < 0.3 
the observations typically consist of four 400s exposures in 
g' and eight 600s exposures in r' . For the higher redshift 
clusters we obtained four 600s exposure in g' and twelve 
600s exposures in r' . These integrations times enable us to 
use galaxies down to r' = 25 in our weak lensing analy- 
sis. We only use the r' data because they are deeper and 
have better image quality. The g data are used to identify 
the cluster early-type galaxies, which lie on a well-defined 
color-magnitude relation. The exposure times and informa- 
tion about the data reduction of the clusters observed using 
the CFH12k ca mera in the B and R filters can be found in 
iHoekstral (|2007T ). 



2.1 Processing of optical imaging data 

Current wide field imaging instruments such as MegaCam 
consist of a mosaic of chips, and special care needs to be 
taken to account for sudden jumps in the PSF properties 
when data from different exposures are combined. To avoid 
such problems altogether, the CCCP data are obtained in 
two sets of exposures. Each set of four (or six) exposures 



1 These are MS1224.7+2007, MS1231. 3+1542, MS1455.0+2232 
and MS1512.4+3647. 



is taken with small dithers. The two sets are offset by ap- 
proximately half a chip in each direction to fill in most of 
the gaps between chips. Rather than combining all our data 
into a single deep frame, for our weak lensing analysis we 
consider each set separately and combine the measurements 
in the catalog stage. 

The data were detrended using the Elixir pipeline de- 
veloped at CFHT. The pipeline also provides photometric 
zeropoints and in most cases a reasonable first order as- 
trometric solution. We verified that the data were indeed 
taken during photometric conditions. The astrometric solu- 
tion provided by Elixir is not sufficiently accurate to com- 
bine the exposures into deeper images without affecting the 
PSF: errors in the astrometry lead to additional anisotropies 
in the images. Instead , we follow the procedure described in 
iHoekstra et al.l (|2006l ) to refine the astrometric solution. We 
use the USNO-A2 catalog to calibrate the red images from 
the Digital Sky Survey (POSS II), which in turn is used 
to generate a catalog of sources with accurate astrometric 
positions. 

This new astrometric catalog is matched to each of the 
MegaCam images. Each set of exposures is then processed 
separately. We detect objects in each of the exposures and 
combine matched objects into a master catalog, which con- 
tains the average positions of the matched objects. This mas- 
ter catalog is used to derive the final second order astromet- 
ric solution for each chip. This step ensures that the objects 
in each set of exposures are accurately matched to the same 
position. 



3 WEAK LENSING ANALYSIS 

We briefly review the steps in the weak len sing analy- 
sis, but refer the reader to IHoekstral (120071 ) for a de- 
tailed discussion of the analysis and various issues that 
arise when interpreting the data. The first step is to de- 
tect the faint galaxies and identify the stars. The stars are 
used to correct the galaxy shapes for the effects of PSF 
anisotro py and seeing . To do so, we use the procedur e deve l- 
oped by iKaiser et all i|l995l ) and iLuppino fc Kaiser] l|l997l) . 
Modifications to the ori ginal approach are d escribed in 
IHoekstra et al.l l|l99ct ) and lHoekstra et al.l l|2000r ) . The same 
analysis pipeline was also used f o r the cosmic shear anal- 
ysis presented in IHoekstra et all (|2006l ). Furthermore, the 
method also performed well in the Shear Te sting Programme 
l|Hevmans et al.l [20061 ; iMassev et al.1 120071 ). which demon- 
strated that we can recover the weak lensing signal with 
an accuracy of ~ 2%. 

As mentioned earlier, each set of stacked images is anal- 
ysed separately. Because the images were taken with small 
offsets, we can analyse the data on a chip-by-chip basis. We 
thus obtain two catalogs for each cluster of galaxies. The two 
catalogs are merged and the measurements for the objects 
that appear in both are combined. The resulting catalog is 
used to derive the cluster masses presented below. However, 
the redundancy in the data allow us also to check for con- 
sistency of the lensing signal. 

To quantify the lensing signal we consider the az- 
imuthally averaged tangential shear (jt) as a function of 
distance r from the cluster centre (for which we use the 
positions listed in Table [T] also see £|3 . 1 p> - To minimize the 



© 0000 RAS, MNRAS 000, 000-000 



4 Hoekstra et al. 

Table 1. Basic information of the Canadian Cluster Comparison Project sample 
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U.4 ( 


QQ 

oo 


Abell 


2111 


u.zzy 


1 ch oom Ar\ cs 

io oy 4u.o 


1 0/1°0C_/07 // 
+o4 10 It 


1 OQ 

lUo 


on oc 
ZU-ZO 




n C7 
U.O / 


o oo 
U.oo 


Oi) 


Abell 


2163 


o ooo 
U.ZUo 


1 h i cm /i n os 

io io 4y.u 


— UO Uo 41 


1 1 
1 1 


on oc 
ZU-ZO 




n fir\ 
U.OU 


O Qfi 

U.oO 


1 o 
4U 


Abell 


2204 


o i co 
U. loZ 


i «h oom a v ns 
10 oZ 4/.U 


i nco Q/i/qo// 
+UO o4 v5o 


A 

L 


on oc 
ZU-ZO 




n fiQ 
U.Oo 


O A fi 

U.40 


41 


Abell 


2259 


0.164 


iT'hoomoo 7s 


1 070 A r\l C\Qff 

+ZY 4U Uo 


7C 

/o 


20-25 




0.65 


O A A 

U.44 


42 


Abell 


2261 


0.224 


17 h 22 m 27.2 s 


+32°07 / 58 // 


5 


20-25 




0.57 


0.33 


43 


Abell 


2537 


0.295 


23 h 08 m 22.2 s 


-02°ll / 32 // 


8 


21-25 




0.49 


0.24 


44 


MS0440.5+0204 


0.190 


04 h 43 m 09.9 s 


+02°10 / 19 // 


4 


20-25 




0.60 


0.37 


45 


MS0451. 6-0305 


0.550 


04 h 54 m 10.8 s 


-03°00'51 // 


34 


22-25 




0.28 


0.08 


46 


MS1008.1-1224 


0.301 


10 h 10 m 32.3 s 


-12°39 / 53" 


12 


21-25 




0.47 


0.23 


47 


RXJ1347.5-1145 


0.451 


13 h 47 m 30.1 s 


-ll°45 / 09 // 


7 


22-25 




0.34 


0.12 


48 


RXJ1524.6+0957 


0.516 


15 h 24 m 41.6 s 


+09° 59' 34" 


44 


22-25 




0.30 


0.09 


49 


MACS J0717.5+3745 


0.548 


07 h 17 m 30.4 s 


+37°45 / 38 // 


113 


22-25 




0.27 


0.07 


50 


MACS J0913.7+4056 


0.442 


09 h 13 m 45.5 s 


+40°56 / 29 // 


1 


22-25 




0.36 


0.13 


51 


CIZA J1938+54 


0.260 


19 h 38 m 18.1 s 


+54° 09' 40" 




21-25 




0.52 


0.27 


52 


3C295 


0.460 


14 h ll m 20.6 s 


+52°12'10" 


5 


22-25 




0.34 


0.12 



Column 2: cluster name; Column 3: cluster rcdshift; Column 4,5: right ascension and declination (J2000.0) of the adopted 
cluster center. In all but three cases (Abell 520, Abell 851 and Abcll 1914) we take this to be the position of the brightest 
cluster galaxy (BCG). Column 6: the offset between the peak in the X-ray emission and the position of the BCG. For the 
values marked a we actually use the X-ray position as the cluster center. Note that not all clusters have modern X-ray data; 
Column 7: magnitude range used for the source galaxies. For clusters 1 — 20 this is the Rq filter and r' for the remaining 
clusters; Column 8,9: the average values of /3 = Di 3 /D a and (/3 2 ) (as explained in the text) 
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effect of the gaps between the chips we oniy consider the 
signal between 0.5 — 2/if 1 Mpc and fit a singular isothermal 
sphere model: 

(7T> = g, (1) 

where te is the Einstein radius. Under the assumption of 
isotropic orbits and spherical symmetry, the Einstein radius 
(in radians) is related to the line-of-sight velocity dispersion 
a through 

r E =^(^(l3), (2) 

where {/}) = (Di 3 /D s ) is the average ratio of the angular 
diameter distances between the lens and the source, and the 
observer and the source. The value of (j3) depends on the 
source redshift distribution, and is discussed in detail in the 
following section. For our comparison we do need to account 
for small variations in the actual source redshift distribution 
because the two images may have different seeing. This is 
particularly true for pointings that were observed a year 
apart. We therefore follow the procedure described below 
and also determine average values for j3 for each pointing. 

Figure [2] shows the results of our comparison: the inde- 
pendent measurements of the lensing signal for each cluster 
agree very well. For the difference in te between the two po- 
sitions we find (Ar E ) = 0.07 ±0.37. The reduced \ 2 = 0.51, 
which is too small if the two measurements would have been 
uncorrelated. However, the uncertainty in te is determined 
by the error in the shape measurements, which is a combi- 
nation of the noise in the image and the intrinsic shapes of 
the source galaxies. The latter will cause the measurements 
to be correlated and we need to account for this in the com- 
parison. We find that due to gaps between the chips, masks, 
etc. ~ 70% of the objects appear in both catalogs within the 
aperture we use to fit the model. We estimate the expected 
variation in the difference in rg by simulating the effect of 
image noise and shape noise from objects that appear in 
only one of the catalogs. We find that the random error is 
approximately 70% of the total (formal) error and we ex- 
pect a reduced % 2 ~ 0.5, which is in good agreement with 
the actual value. This suggests that the results obtained for 
the two positions are indeed in good agreement with one 
another. 



3.1 Choice of cluster center 

To quantify the lensing signal we need to define the clus- 
ter center. If the adopted center is offset from the 'true' 
center the tangential shear is lowered and the inferred 
mass is also biased low. Hence, one option is to deter- 
mine the location that maximizes the lensing signal, for 
insta nce by reconstructing t he projected mass distribution 
fe.g., lKaiser fc Sauiredfl993l ), or fitting a model to the data. 
Such an approach, however, will lead to masses that are bi- 
ased high. Instead we (typically) choose the location of the 
brightest cluster galaxy (BCG) to define the cluster center. 

The adopted centers are listed in Table[T] The projected 
offset between the location of the BCG and the pe ak of the 
X-ray emission determined by Ma hdavi et al.l (|2012T ) are also 
listed. The distribution of offsets is presented in Figure [3] 
With the exception of the four clusters discussed below, the 




10 20 

r po S1 [..] 



Figure 2. top panel: Plot of the best fit Einstein radii for the two 
sets of exposures. We have accounted for the small differences in 
the value for /3 for the two sets, bottom panel: Residuals from the 
one-to-one relation between Einstein radii. The lensing signals 
inferred from both data sets agree very well and the observed 
scatter agrees well with the expected variation. 

offsets are less than ~ lOOkpc, resulting in negligible biases 
in the cluster masses (see Figure [3] and discussion in i )4.2l) . 
In three cases we use the position of the peak of the X-ray 
emission. Abell 520 is a well known mer ging system, and pos- 
sibly the result of a three-way merger (jMahdavi et al.ll2007l : 
lOkabe fc Umetsull2008l : Ijee et al.l2012h . The unrelaxed clus- 
ter Abell 851 lacks a well defined brightest cluster galaxy 
and the X-ray emission peaks away from the concentration 
of bright cluster members. The BCG in Abell 1914 also 
shows a large offset from the peak in t he X-ray emission . 
It is believed to b e an ongoing merg er Jgahle et all [2002: 
IJones et al.l 120051 : lOkabe fc Umets ul 120081), sh owing high 
brightnes s tidal features rtFeldmeier et al.l 120041) and a ra - 
dio halo (|Giovannini et al.lll999l : iKempner fc Sarazi 3 1200 j ). 
Abell 1758 is als o a merging system with two peaks in the 
X-ray emission (|David fc Kempnerl 120041 : IRagozzine et al.l 
I2012T ). One peak is well centred on one of the BCGs, whereas 
the other component is clearly offset from the other BCG, 
which we nonetheless adopt as the center of the other clus- 
ter. 

3.2 Source redshift distribution 

To relate the lensing signal to physical quantities, such as 
mass, requires knowledge of the redshifts of the faint source 
galaxies. The higher the source redshift, the higher the lens- 
ing signal for a given mass. The redshifts need not be known 
with high precision, and photometric redshifts are sufficient. 
We lack the color information to derive photometric red- 
shifts for our sources, but fortunately it suffices to assume 
an average redshift distribution that can be obtained from 
other data sets. This does lead to an increase in the uncer- 



© 0000 RAS, MNRAS 000, 000-000 



6 Hoekstra et al. 



tainty wi th which the cluster m ass can be determined. As 
shown in iHoekstra et al.l (|2011al ') this contribution is small 
for clusters with z ~ 0.2, and only becomes relevant when 
studying high redshift clusters (z > 0.6). 

We the r efore can use the photometric redshifts from 
lllbert et af] (|2006h which are based on the CFHT Legacy 
Survey Deep fields and well matched in depth to what we 
need. Importantly, the survey covers a sufficiently large area 
of sky (4 fields of one square degree each) that sample vari- 
ance, which has plagued earlier studies, is no longer impor- 
tant. 

The use of photometric redshifts naturally accounts for 
galaxies in front of the cluster, which do not contribute to 
the lensing signal. However, the regions around clusters show 
large enhancements of galaxies (after all, that is what clus- 
ters of galaxies are) and many of those will end up in our 
catalog of sources. Our data only allow us to remove galax- 
ies on the red-sequence. For our choice of filters, this only 
leads to a modest reduction in the level of contamination, 
because most faint cluster members are in fact blue. 

The level of contamination after removing galaxies on 
the red -sequence depends o n the choice of filter. For in- 
stance, lOkabe et al.l (|2010ah find that their V and i' data 
allows for a relatively clean selection of sources. As discussed 
below, compared to our Megacam observations, th e contam- 
inatio n is also lower for the clusters studied in IHoekstral 
|2007l) . which are based on B and Rc data. Hen ce our com- 
binati on of g' and r' is not ideal. As shown in Hi gh et al.l 
l|2012i ). an efficient way to remove cluster members for our 
sample would be to obtain a dditional i ' data . We lack such 
data and therefore we follow IHoekstral (|2007l ) and derive an 
average correction for each cluster, by measuring the excess 
of source galaxies as a function of radius. We assume that 
the excess declines as f cg (r) oc r _1 , which is a reasonable 
description of the observed excess counts (c.f. Figure 3 in 
IHoekstral [2007h . Note that we do exclude the galaxies on 
the red-sequence. To correct for the contamination by clus- 
ter members, the observed tangential shear is then scaled by 
a factor 1 + f cg (r). 

The correction leads to a ~ 13% increase in the best 
fit Einstein radii for the Megacam data, but only a ~ 4% 
increase for the CFH12k data. For the aperture masses the 
correction depends on the overdensity. We find that on aver- 
age the results for M2500 are boosted by ~ 18 and ~ 6% for 
the Megacam and CFH12k data, respectively. The contam- 
ination is less important for M500, which is based on mea- 
surements at larger radii: the masses based on the Megacam 
data are on average increased by ~ 8%, whereas the masses 
of the other clusters are boosted by only ~ 3%. 



4 MASS DETERMINATION 

One of the advantages of weak gravitational lensing is that 
it provides a direct measure of the projected mass distribu- 
tion, without having to make assumptions about the den- 
sity profile. In practice, however, the conversion of the ob- 
servations into useful mass estimates does depend on the 
density profile, although the dependence can be weak. Fur- 
thermore, projected masses cannot be compared directly to 
results from other techniques, such as X-ray observations. 
Alternatively one can assume a parametric form for the den- 



sity distribution and fit the predicted tangential distortion 
to the data. We discuss and compare the results from both 
approaches in the following sections. 



4.1 Parametric mass model 

Numerical simulations of cold dark matter indicate that the 
density profiles of dark matter halos are well described b y 
the fitting function proposed bv lNavarro et all (|l996l , ll997f ). 
The profile is characterized by two parameters: the mass of 
the halo and the concentration c (or characteristic scale). A 
commonly used method to infer cluster masses is to com- 
pare this model to the observed lensing signal and to deter- 
mine the best fit values for the mass and the concentration, 
although c is typically poorly constrained. In simulations, 
however, the mass and the concentration are found to be 
correlated, albeit with an intrinsic scatter, which reflects 
the variati on in halo forma tion histories. We follow the def- 
initions of IHoekstral (|2007h . but use a more recent relation 
between the virial mass M v j r and co ncentration c, which 
was determined bv lDuffv et alj (120081 ) based on the cosmo- 
logical parameters th at best fit the WMAP5 observations 
iKomatsu et al.ll2009l '): 



M NFW 

c=7.85' Vlr 



2 x 10 12 



(1 + *)- 



(3) 



This relation is used when we fit the NFW model to the 
lensing signal at radii 0.5 — 2^7 1 Mpc. The resulting values 
for AC FW are presented in Table [2] For reference, we also 
list the values for M^q^ and , which are the masses 

within the radii where the mean mass density of the halo 
is respectively 2500 and 500 times the critical density at 
the redshift of the cluster, computed from the best fit virial 
mass (and Eqn. [3). We also fit a SIS model to the shear at 
these radii. The resulting Einstein radii and corresponding 
velocity dispersions are also listed in Table [2] 

For the model fit, we avo i d both the small and large 
radii. As discussed in IHoekstral d200ll . 120031 ) : IHoekstra et alj 
l|2011bh : iBecker fc Kravtsovl l|201lf ) structures along the 
line-of-sight contribute noise to the mass measurement. This 
'cosmic noise' dominates at large radii, where the cluster sig- 
nal is small. We account for the noise introduced by large- 
scale structure in the error budget. We do not fit the small 
scales in order to reduce (residual) contamination by clus- 
ter members and the effects of substructure (i.e., deviations 
from the simple NFW profile). Generally, the presence of 
substructure in the cluster core causes a redu ction in the tan- 
genti al shear, which biases the masses low (Hoekstra et al. 
2002). As discussed in £13. II the choice of cluster center is 
also important, as an offset from the true centre biases the 
mass low as well (also see Fig. [3} ■ The bias is smaller if the 
offsets are small relative t o the scales on which t he shear is 
measured (see Figure 4 in IHoekstra et al.ll2011al ). 



4.2 Deprojected aperture mass 

Rather than adopting a parameterized model that is fit to 
the data, a unique feature of weak lensing is that the ob- 
served shear can be related directly to a density contrast. In 
essence, it allows us to infer the projected mass within an 
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Table 2. Canadian Cluster Comparison Project sample 



(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


(9) 


(10) 


(11) 


name 


r E 

[arcsec] 


cr 

[km/s] 


0.5 


7*2500 


2500 


T500 

[fey^kpc] 


500 


M NFW 
vir 


2500 


500 


Abcll 68 


17.5 + 2.6 


1098±5* 


4.0 + 0.6 


520±y 


2.6+°'® 
—0.5 


H78lg 7 


°- u -1.6 


lt3 -°-3.0 


„ , in 7 

2-7121 


7-511:? 


Abell 209 


15.9 ± 2.8 


4-7 

9371™ 


3.2 + 0.5 


4771?/; 


J-0 A 


12541^5 


4-1 r i 

6-9111 


4-9 A 

8-9111 


1-8121 


5.oii;5 


Abcll 267 


14.8 ± 2.8 


4_Sfi 

9901*1 


3.5 + 0.6 


4891^ 


2 1 + u - 4 


H44l^ 


4-1 7 

5-4111 


_4_9 S 

9-7111 


2-0121 


4-1 r i 

5-41H 


Abcll 370 


21.4 + 2.8 


13421*° 


5.8 + 0.8 


57013?, 


3 Q+°* 
°' y -0.7 


14531^ 


- . » „4_9 % 

1Z * y -2.4 


28-6121 


5-21H 


15.61*; 7 


Abcll 383 


11.0 + 3.4 


__ ,,_ _|_1 11 

Bellas 


2.0 + 0.6 


36llg 


u - ' -0.3 


1059±l|3 


.-4-10 

^■ A -1.6 


. ^4-9 n 
^•°-2.0 


„ i_o 4 

u - 9 -0.4 


^ ,4-11 


Abcll 963 


13.6 ± 2.6 


4_7» 

8931™ 


2.4 + 0.5 


42011 


4_n a 
1 - 6 -0A 


4- QA 

1020l? 4 2 


_4_1 A 

3-7111 


_ 4-9 1 

7-9111 


_i_n 4 
1 6+ 
i - D -0.5 


. .4-1 9 

4 4+ 1 


Abell 1689 


31.1 + 3.0 


12981^ 


6.3 + 0.6 


659±f 5 


A q+ - 6 
^' y -0.6 


15941*1 


13-91^1 


26-4111 


4 q+0.9 


- . - 4-9 K 

14-1111 


Abcll 1763 


19.2 ± 2.9 


4_77 

W75±H 


4.4 + 0.5 


4-97 

554l|5 


4_n ^ 
°- u -0.5 


4-1 OzL 

i4ZZ -104 


4-9 7 

IO.2IH 


13.6I3I 


7+O.6 
z -'-0.6 


4- 1 R 

' -°-1.8 


Abell 2218 


19.5 + 3.1 


10241™ 


3.6 + 0.6 


52413* 


Z -^-0.6 


1155l* 


-LI 4 

°- Z -1.3 


11.8111 


2 4+ -6 
Z - 4 -0.6 


4- 1 7 

6-5111 


Abcll 2219 


18.5 ± 2.7 


1067+ 70 

luo -75 


4.2 ± 0.6 


545+ 32 


9 q+0.6 
z - y -0.6 


1374+ 76 


9 3+ 2 '° 

y -°-1.9 


14 3+ 3 ' 4 


2 8+°' 7 
z -°-0.6 


70+1.9 

' -°-1.6 


Abcll 2390 


23.2 ± 2.6 


1 171+ 61 


4.3 + 0.5 


555 2g 


q 1+0.5 

—0 5 


13451^ 


8-7111 


20.71*1 


n+0.7 
— 7 


H-2l?1 


MS 0015.9+1609 


14.1 ± 2.7 


13561^ 


6.6 + 0.8 


5671^ 


4.7±g : g 


16261^° 


22.2+11 


26.8+*1 


4-8+11 


14.8121 


MS 0906.5+1110 


16.0 ± 2.8 


906+™ 


3.0 + 0.5 


477I32 


l-Slgl 


13801™ 


8.9l 2 i 


9.3+li 


1.9l°1 


5-1111 


MS 1224.7+2007 


8.6 + 2.8 


8171^ 


2.5 + 0.7 




1-2121 


8811/1 


2 7+ i:i 

— 1 


4 7+2-5 

— 2 1 


1-0121 


2-7111 


MS 1231.3+1542 


5.1 ± 2.6 


5491^1 


0.8 + 0.5 


295+ 4 4 ° 


0.5±g : 2 


547lf 25 


0-6121 


1-41J1 


0-3121 


0.8121 


MS 1358.4+6245 


12.8 ± 2.6 


9941* 


3.6 + 0.6 


47l!^ 


9 1+0.4 

— 5 


1140+*? 


5.9ll :9 7 


10.71*1 


2-1121 


6.0ll : 7 


MS 1455.0+2232 


14.6 ± 2.3 


945-W 


2.7 + 0.5 


423+29 


1-4121 


105ll 4 ° 


4.3111 


8.6111 


1-8121 


4.8+11 


MS 1512.4+3647 


7.7 + 2.7 


786112° 


1.8 + 0.7 


326 +4 ° 


0-7121 


859l2i5 


2.7+11 


4 3+20 

— 2 


0-9121 


2.5111 


MS 1621.5+2640 


15.0 ± 2.7 


1186lf 08 


4.2 + 0.7 


476+ 4 ° 


2.4i; : * 


1205l?1 


7-81^2 


16.8121 


q 9+1-0 
9 


9-4111 


CL0024.0+1652 


16.9 + 3.0 


12421^ 


5.2 + 0.8 


5451*2 


3-5l2: 7 


13231°'^ 


Q Q + 2 - 8 
— 2.8 


20.41^1 


3-8111 


11.2+11 


Abcll 115N 


11.9 + 2.8 


814l 8 9 8 8 
8191^ 


1.7 + 0.5 


4- ^ 

3321^ 


4-0 A 
(V 


1 041 + 


4-1 A 

3-91H 


4-1 7 

°- ' -1.9 


1 2+° 4 


q O+l-O 


Abell 115S 


12.0 ± 2.8 


2.4 + 0.5 


_ _ _ 4_«9 

355l 8 3 


„ ^4_n 7 
0-8121 


H57l 72 


_ , 4- 1 

5-41H 


„ „_|_9 1 

6-8l?1 


. 4-0 4 
1 - yi -0.4 


3.8111 


Abcll 222 


15.1 ± 2.9 


934l«2 


3.0 + 0.5 


45ll£ 


1-6121 


118ll 7 * 


5-81H 


7-9111 


1 6+°' 5 


4 4+1- 4 
^■^-1.2 


Abcll 223N 


16.7 + 2.9 


_ 4-78 

9761™ 


3.0 + 0.5 


J-A7 

441 + 4 ' 


1 -°-0.7 


1235l?g 


6.61H 


_4_9 S 

y * ' -2.4 


2 n+ 

Z - U -0.5 


c q+1-5 


Abell 223S 


11.3 + 3.0 


4- Of; 

805l^ 9 


2.5 + 0.5 


4_ E i7' 

3881S 


1 n+0.5 
1 - u -0.4 


4-1 07 

126ll^ 7 


4-9 1 

7.0+11 


_4_9 n 

5-7111 


_i_n 4 
1 2+ 


q 9+1-1 


Abell 520 


20.4 + 2.7 


10641^ 


3.4 + 0.5 


4951^ 


2 1 +°- 4 


H76l™ 


5.6111 


13.2111 


2 6+°' 5 
z -°-0.6 


7 2+ 1 - 4 
'■ z -1.5 


Abell 521 


11.8 ± 2.9 


_|_QQ 

31° 


3.0 + 0.5 


4-^7 

420lg 


1 4+0.0 


12021?* 


6 4+ 1, ' 


4-9 

8-6111 


4-H r i 

1-8121 


-±- 1 4 

4-8111 


Abell 586 


14.1 + 3.2 


868_ 103 


2.5 + 0.6 


4201^ 


1 q+0.5 

1 -°-0.4 


1 1 92+°l 
1±yz -100 

11461^ 


c 7+1-6 
°-'-1.5 


6-8111 


1-4121 


3-8111 


Abcll 611 


12.8 ± 2.9 


4-00 

yj ^-110 


3.3 + 0.7 


J-AA 

459l 44 


1-8121 


4-1 ^ 

5-SlH 


4-9 S 

8.211;* 


1 7+O.6 
-0.5 


4- 1 R 

4 6 + 


Abcll 697 


16.6 ± 2.7 


4_«1 

1059l|J, 


4.1 + 0.5 


4-^9 


2 6+ 
z - D -0.5 


137ll 4 ° 


4-1 ^ 

9-8111 


4-^ n 
11.4l|° 


q+0.6 
Z - r> -0.5 


4- 1 7 

6-4111 


Abcll 851 


18.1 + 3.0 


1 272+ 98 
iz,z -105 


5.1 + 0.5 


522l 2 * 


n 1 +0.4 
d -l-0.4 


1343lgg 


10.6111 


20.015; 4 


3-8lo1 


11.1+11 


Abell 959 


21.8 ± 2.9 


, i_77 

until 


4.7 + 0.6 


4-9Q 

55411 


4_n a 
°- Z -0.6 


4-79 

1276l 7 j 


_ _l_17 

7 0+ 1 ' ' 


18-6l 4 1 


3 6+°'* 

O.D_ 7 


4-9 9 

10.211' I 


Abell 1234 


17.3 ± 2.8 


9381^4 


2.3 + 0.5 


4-90 

423I3 9 


1 q+0.3 

1 -°-0.3 


980l°2 


4-1 9 

3 1 + 

°- 1 -i.o 


7-9111 


_i_n 4 
1 6+ 


4-1 h 

4.411-5 


Abell 1246 


14.4 + 2.6 


4-71 

880l 77 


2.5 + 0.4 


4- ha 

409I43 
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Figure 3. Plot of the ratio of the inferred lensing mass and the 
true mass as a function of centroid offset for an NFW halo with 
a virial mass M = 5 X 10 14 Mq. The lensing mass is inferred from 
the £ c statistic as explained in the text. The curves correspond to 
A/2500 (lowest curve; black), Miooo (red), M500 (cyan) and A/200 
(highest curve; blue). For overdensities A < 1000, the observed 
value for Ma is not very sensitive to centroiding errors because 
the large scale shear is not affected. The histogram shows the 
distribution of offsets between the BCG and the peak of the X- 
ray emission, suggesting that miscentering is a small effect for the 
CCCP sample. 



aperture, with relatively few assumptions about the actual 
mass distribution. We use that (Clowe et al. 1998): 



Cc(n) 



r r 2 o„2 
2 1 ji ■/-. \ 1 max 



dlnr(7t) + ■ 



dlnr( 7t ), (4) 



which can be expressed in terms of the mean dimensionless 
surface density interior to ri relative to the mean surface 
density in an annulus from r-z to r max 



Cc(n) = re(r' < n) - Ac(r 2 < r < r max ). 



(5) 



This shows that we can determine the average surface 
density within a given aperture up to a constant (i.e., the 
mean convergence in the annulus). The surface density in 
the annulus cannot be ignored, even though it is small for 
the wide field imaging data used here. We estimate the mean 
surface density in the annulus based on the best fit NFW 
model. For the clusters observed with the CFH12k camera, 
we adopt r2 = 600" and r max = 1000", and for the Megacam 
observations we use r2 = 900" and r max = 1500" (reflecting 
the larger field-of-view) . 

Thanks to our ability to measure the lensing signal out 
to large radii, the (model dependent) correction to the mass 
is only ~ 8% for M500 and ~ 3% for M25oo- The correc- 
tion is larger if we use the best fit SIS model instead of 
the best NFW fit. The difference is largest for M500, which 
would increase by ~ 10% compared to the results listed in 
Table [2l whereas M2500 would increase by ~ 4% if we use 



the SIS fit to estimate the convergence in the annulus. Note 
that lOkabe et al. I (|2010al ) assume that the contribution in 
the annulus can be ignored, which is definitely not the case, 
considering the bias it introduces for a large sample of clus- 
ters. Consequently we cannot compare our results directly 
to the aperure masses listed in lOkabe etail (|2010al ). 

Although the NFW model should be a good description 
of the average mass distribution on small scales, it is less ac- 
curate on large scales because structures near the cluster will 
contribute to the lensing si gnal. This can be stu died in the 
context of a halo-model (e.g. Johnston et al.l2007l ). However, 
for the radii and masses we study here thi s so-called two- 
halo contribution is expected to be small (jjohnston et al.l 
120071 ; iBecker fe Kravtsovll201ll ) . 

We examined the effect of errors in the adopted cluster 
center on the inferred aperture masses. The results are pre- 
sented in Figure [3] for four overdensities A. For higher over- 
densities one needs to integrate the shear signal to smaller 
radii and the mass becomes more sensitive to centroid off- 
sets. However, the observed offsets between the BCG posi- 
tion and the peak of the X-ray emission suggest that the bias 
in our masses are negligible (even for M25oo)- Note that for 
three of the four clusters with offsets larger than 200/i^ 1 kpc 
we use the X-ray center instead (see H3.ip 

To compare to results from other methods the aper- 
ture masses need to be deprojected, under the assumption 
of spherical symmetry. Non-parametric deprojections are 
noisy, and we th erefore employ a different approach, which 
was also used in lHoekstral (|2007l ). For each aperture we de- 
termine which NFW model would yield the observed pro- 
jected mass. The corresponding mass of the model is then 
taken to be the deprojected mass. Hence we assume that 
the density profile along the line-of-sight is described by the 
NFW model. Note that each aperture is treated indepen- 
dently: the corresponding virial mass (and consequently the 
concentration given by Eqn. [3]), is allowed to vary with ra- 
dius. The resulting profiles are used to measure r&. The 
results for A = 2500 and 500 are listed in Table [21 along 
wit h the corresponding v alues for ta- 

IBecker fc Kravtsovl (|201ll ) have shown that fitting an 
NFW model to the observed lensing signal can lead to biased 
mass estimates wh en the mass and concentration are free pa- 
rameters (also se e [Mencghetti et al. 2010; B ahe et ai1l2012l ; 
iHigh et al.ll2012l ). IBecker fc Kravtsovl (|201ll ) also make the 
point that the bias is likely to depend on the method that 
was used to infer the mass, as well as the range in angular 
scales that is considered. In particular, they show that ex- 
tending the NFW fits to large radii (> 10') tends to bias 
the masses low by ~ 6%. Most of the bias appears to be 
caused by the fact that the NFW fit overestimates the re- 
duced tangential shea r on scales of 10' — 20' before two-halo 
contributions (see e.g. Ijohnston et al.|[2007l ) become impor- 
tant. The NFW fit results presented in Table[2]are based on 
the lensing signal at radii 0.5 — 2ft,T 1 Mpc, for which the bias 
should be small. 

The si t uatio n for the aperture masses is less clear. 
IHigh et alj (|2012l ) used simulations to find that their ap- 
proach underestimates the masses at -R500 by 6 — 13%. How- 
ever, they use the ("-statistic proposed by iFahlman et al.l 
(|1994|) . which requires an estimate of k in an annulus ranging 
from -R500 out to 16', which is the largest scale considered by 
IHigh et all (|2012l ). Compared to our approach, the correc- 
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Figure 4. left panel: Value for M500 from the best fit NFW model to the lensing signal at radii 0.5 — 2/i^ 1 Mpc versus the M500 inferred 
from the deprojected aperture mass. The mass-concentration relation from Duffy et al. (2008) was used for the NFW fit. The dashed 
red line indicates the line of equality, right panel: The average ratio of the best fit NFW mass and the deprojected aperture mass when 
the normalization of the mass-concentration (Equation [3j is changed. High normalisations lead to inconsistent estimates for M500. 



Table 3. Senstivity to c(M) normalization 





aperture mass 


NFW fit 


A 


a = 0.8 


a = 1.2 


a = 0.8 


a = 1.2 


2500 


0.88 ±0.03 


1.09 ± 0.03 


0.90 ± 0.03 


1.07 ±0.03 


1000 


0.92 ±0.01 


1.06 ±0.01 


1.00 ±0.04 


0.99 ±0.03 


500 


0.94 ±0.01 


1.04 ±0.01 


1.06 ± 0.04 


0.95 ±0.03 


vir 






1.16 ± 0.05 


0.89 ±0.03 



Table of the ratio (MA(c«crj u gfy)/MA(cDufFy)) when the masses 
are estimated from deprojecting the aperture masses (Columns 2 
& 3) or by fitting an NFW model to the data (Columns 4 & 5). 
The errors correspond to the standard deviation of the ratios. 



tion for the convergence in this annulus is much larger, about 
35% at -R500, and thus more sensitive to deviations from the 
assumed density profile. We expect the bias to be propor- 
tional to this correction, and consequently we expect any 
bias in our d eprojected masses to be considerably smaller 
than those of Irligh et all (|2012l ). 



4.3 Sensitivity to mass-concentration relation 

Both procedures we use to derive cluster masses depend on 
the adopted mass-concentration relation (Equation [3} . It is 
important to note that c(M) depends on the cosmology: 
the concentration is related to the mea n density of the Uni- 
verse when the clus ter was formed (e.g. lNavarro et alii 19961 ; 
iBullock et aill200ll ). As more massive halos form later in 
a ACD M cosmology, they will have smaller concentrations 
(but see lPrada etai1l20lj ). Furthermore, the mass depen- 



dence is rather modest when considering a relatively small 
mass range, as is done here. 

The amplitude, however, is sensitive to the matter den- 
sity fi m and the normalization of the matter power spectrum 
as and various fitting functions have been published (e.g . 
Bullock et all l200ll ; iNeto et all 120071 ; iMaccio et ail 120081 ; 
Prada et al.ll2011f ) in addition to Equation The observed 
mass-concentration relation can therefore be use d in prin- 



Ettori et al 



Prada et al 



ciple to constrain cosmological parameters 
l2010t) . Note that the simulations studied by 
(2011) prefer a significantly higher normalization. This claim 
app ears to be supported by a number of observat ional results 
(e.g. lZitrin et al.ll20ld . [201ll ; IUmetsu et al.ll201ll ). It is, how- 
ever, not clear whether these findings are biased by the fact 
that many of these are well-known strong lensing clusters. 
For instance, accounting for the 3D st ructure of the clusters 
appe ars to yield lower concentrations dMorandi fc Limousin! 
12012? ). An important complication, which requires further 
study, is the role of baryons, which have not been considered 
in these studies. The effect depends on the feedback model, 
with steeper profiles if radiative cooling is efficient, and shal- 
lower profiles if f eedback is effective (e.g. iLewis et al. I l2000l ; 
iDuffv et al.ll2010l ). 

A detailed study of the mass-concentration relation is 
beyond the scope of this paper, but it is nonetheless use- 
ful to examine the sensitivity of our mass estimates to our 
choice for c(M ) by varying its normalization. We studied the 
average change in the deprojected aperture masses and the 
NFW fits and list the results in Table|3]for different overden- 
sities. The deprojected mass converges to the projected mass 
for large radii and hence it is not surprising that the aper- 
ture mass values for M500 are the least sensitive, with a 20% 
variation only changing the masses by ~ 5%. The NFW fits 
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show a similar variation for M500 but the dependence with 
c is opposite. Interestingly, the value for M^ot^ is nearly 
independent of the adopted normalization, suggesting that 
this provides the most robust mass estimate when fitting an 
NFW profile to the tangential shear profile between 0.5 and 
2/if 1 Mpc. 

If the density profiles are (on average) well described 
by the NFW profile we expect good agreement between our 
different approaches to measure the cluster masses. The left 
panel of Figure [4] shows the results when M5Q0 from the 
NFW fit is compared to the deprojected aperture mass es- 
timates Mgg . Note that the results are slightly correlated 
because the NFW fit uses data out to 2/i^ 1 Mpc, whereas 
the typical values for rsoo are ~ 1.2ft,^ Q 1 Mpc. Hence there 
is some overlap in the shear signals that were used to de- 
rive the masses. We fit a linear relation to the masses and 
find that the ratio of NFW to deprojected aperture mass is 
0.93 ± 0.04. 

The right panel of Figure [4] shows how this ratio varies 
with the normalization of the mass-concentration relation 
for both M2500 (black points) and M500 (red squares). We 
find that for A = 2500 the agreement is good for a wide 
range of normalizations. The comparison of M500 suggests 
a preference for a lower concentration, but a more detailed 
analysis is needed before a conclusion can be drawn. For in- 
stance, the presence of substructure biases the NFW masses 
low. How ever, significantly larger concentrations, such as 
found by IPrada et al.l (|201ll ) lead to inconsistent mass es- 
timates, and thus are at odds with our results. 



5 SZE SCALING RELATION 

On sufficiently large scales, clusters can be considered repre- 
sentative reservoirs of baryons and dark matter. We there- 
fore expect correlations between the observable properties 
and the underlying mass: more massive systems should have 
more of everything. These so-called scaling relations are the 
result of the physical processes that give rise to the forma- 
tion and evolution of galaxy clusters. If gravity is the dom- 
inant process, the scaling r elations and their evolution can 
be predicted (|Kaiserl ll986T ). These self-similar models pre- 
dict simple power law scaling relations between the baryonic 
tracers and the cluster mass. 

Additional non-gravitational processes, such as AGN 
feedback can in principle lead to significa nt deviations from 
the simple single pow er law model (e.g., iBabul et al.ll2002l ; 
iMcCarthv et al.ll2004f ). Gravity is, however, expected to be 
the dominant process for the masses probed by CCCP. Fur- 
thermore the mass range we study here is limited and we 
therefore assume that the scaling relation between the lens- 
ing mass Mwl and mass proxy Af proxy , in our case the 
Sunyaev-Zel'dovitch effect (SZE) signal, can be described 
by a single power law: 



Mwl = E(zyM 



M, 



pivot 



(6) 



where Mo is the normalisation, M p i vot is the pivot point, 
a the power law slope and E(z) = H(z)/Ho is the nor- 
malized Hubble parameter. If we fit Eqn.[S]to the measure- 
ments in §5.1 the resulting % 2 values are larger than ex- 
pected, suggesting the presence of intrinsic scatter (also see 



iMahdavi et all |2012| . for a comparison to the X-ray prop- 
erties). It is important to account for the intrinsic scatter 
when fitting a scaling relation to the data, because ignor- 
ing the scatter will generally bias the best fit parameters. 
The presence of intrinsic scatter is also relevant when as- 
sessing the performance of mass-proxies: an observable with 
larger statistical errors but smaller intrinsic scatter may be 
preferable over one with small statistical errors but a large 
intrinsic scatter. 

Some of this scatter arises from the fact that the lens- 
ing signal measures the projected mass along the line-of- 
sight, whereas for instance the X-ray luminosity probes the 
virialized regions of the cluster. Hence, the lensing mass 
is more sensitive to the fact that dark matter halos are 
triaxial. The separation of this 'geometric' source of scat- 
ter from the physical scatter requires a careful comparison 
with numerical simulations, incorporating in detail the steps 
taken in the analysis of the observations. The effects of 
triaxiality on weak lensing mass measureme nts have been 
studi e d using numerica l simu l ations (e.g., Metzler et al.l 
120011 : iMeneghetti et al.l [2OI0I: iBecker fc Kravtsovl 1201 j ) 
and analytically JCorless fc Kind 120071'). The stu dies by 
ICorless fc Kind (|2007f ) and lMeneghetti et all (|201Ch suggest 
a contribution of <tm ~ 0.15Mwl due to the triaxiality of 
the cluster dark matter halo. 



5.1 Fitting procedure 

We fit scaling relations to the actual measurements (not the 
logarithm), because they follow a (close to) normal distribu- 
tion, even if the relative error is large. The intrinsic scatter, 
however, is assumed to be log-normal, but relatively small. 
It can therefore be approximated by a normal distribution 
with a dispersion aq w ln(10)Qa"i O g q (we use the log with 
base 10). 

A number of techniques exist to fit a model to data with 
errors in both directions , but we follow a m aximum likeli- 
hood approach (also see IWeiner et alj (|200rJ) for a detailed 
discussion). For a model / with parameters a, the predicted 
values are yi = f(xi;a). The uncertainties in Xi and t/i are 
given by o x ^ and cr Vl i. If we assume a Gaussian intrinsic 
scatter oq in the y coordinate, the likelihood C is given by 



n 



■ exp 



[Vi ~ f{xj\a)f 
2w? 



where Wi accounts for the scatter: 



(*0 



+ cr v 



2 



(8) 



If we consider the logarithm of the likelihood it becomes 
clear why including the intrinsic scatter differs from stan- 
dard least squares minimization: 



21n£ = 2^1n^i + ^ 



+ C, 



(9) 



where the second term corresponds to the usual \ 2 an d C is 
a constant. If there is no intrinsic scatter the first term is a 
constant for a given data set and the likelihood is maximized 
by minimizing \ 2 - However, the first term cannot be ignored 
if intrinsic scatter is included as a free parameter. 
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Table 4. SZ measurements 



(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


name 


r sz 
[hZr, kpc] 


Y 


M proj (< r sz ) 


°500 


Ypsx 


MP ro j(< ef 00 ) 




[io- 10 ] 


[lO^/i-iMg] 


[ZijnMpc] 

L 7 (J J 


[io- 10 ] 


[lO 14 Vn lM 0] 


A68 


616 


1.01 ±0.16 


5.0 ±0.8 








A115N 








1.22 


4.27 ±0.58 


6.5±1.5 


A209 








1.14 


4.5 ±0.4 


8.9 ±1.5 


A267 


484 


0.72 ±0.10 


3.4 ±0.5 








A370 


508 


0.71 ±0.09 


5.9 ± 0.8 








A520 








1.24 


3.9 ±0.5 


8.2±1.7 


A586 


529 


1.03 ±0.14 


2.7 ±0.6 








A611 


482 


0.54 ±0.06 


3.2 ± 0.6 








A697 


568 


1.67 ±0.19 


4.8 ± 0.5 


1.28 


4.3 ±0.4 


12.7 ± 1.7 


A963 








1.11 


1.6 ±0.3 


5.5 ±1.5 


A1689 


664 


3.79 ±0.32 


8.1 ± 0.8 


1.38 


6.0 ±0.7 


16.3 ± 1.9 


A1758W 








1.26 


2.6 ±0.3 


12.8 ± 1.8 


A1763 








1.22 


3.8 ±0.4 


11.2 ± 1.7 


A1835 


672 


2.09 ±0.17 


6.5 ± 0.8 








A1914 


660 


3.01 ±0.25 


4.6 ± 0.7 


1.30 


4.8 ±0.5 


8.3 ±1.5 


A2111 


518 


0.95 ±0.21 


3.6 ± 0.5 








A2163 


682 


6.89 ±0.65 


6.8 ± 0.9 


1.52 


14.6 ±0.6 


15.0 ±2.4 


A2204 


671 


4.43 ±0.51 


6.2 ± 0.9 


1.44 


6.4 ±0.6 


12.0 ± 1.9 


A2218 


581 


1.94 ±0.19 


4.4 ±0.7 


1.12 


3.7 ±0.3 


7.1 ± 1.4 


A2219 








1.38 


7.2 ±0.4 


12.8 ± 2.0 


A2259 


476 


0.82 ±0.30 


1.8 ± 0.5 








A2261 


525 


1.34 ±0.16 


5.3 ± 0.5 


1.33 


4.1 ±0.4 


16.2 ± 1.6 


A2390 








1.39 


4.7 ±0.4 


12.3 ± 1.9 


MS0015.9+1609 


507 


0.73 ± 0.06 


6.7 ±0.8 








MS0451. 6-0305 


526 


0.66 ±0.05 


3.7 ±0.9 








MS0906.5+1110 








1.12 


2.3 ±0.4 


9.3 ±1.6 


MS1358.4+6245 


539 


0.56 ±0.08 


3.9 ± 0.6 








RX J1347.5-1145 


706 


1.61 ±0.18 


7.3 ± 1.2 








MACS J071 7. 5+3745 








1.36 


2.3 ±0.3 


23.5 ± 3.7 


CIZA J1938+54 








1.19 


2.6 ±0.2 


10.8 ± 1.8 



Column 2: r2500 a s determined bvlBonamente et al.l 1 120081) from a joint analysis of X-ray and SZ data; Column 3: integrated Compton 
parameter fromlBonamente et alj (I200SI) ; Column 4: the projected weak lensing mass within fffoQi Colum n 5: the value for Q500 from 
iPlanck Collaboration et alj (l201lh ; Column 6: the integrated Compton Y— parameter within 5©^q from iPlanck Collaboration et alj 
1120111 ); Column 7: projected weak lensing mass within 0^qq- 



5.2 SZE measurements 

The inverse Compton scattering of Cosmic Microwave Back- 
ground (CMB) photons by the hot electrons in the intra- 
cluster medium results in a small distortion of the CMB 
spectrum, known as the S unyaev-Zel'dovich effect (SZE; 
ISunvaev fc Zerdovichlll970h . Depending on the observed fre- 
quency this leads to a reduction or increase in the bright- 
ness of the CMB at the location of the cluster (see e.g. 
lBirkinshawlll999h . The surface brightness of the effect does 
not depend on redshift. As a consequence, it is (almost) as 
easy to detect high redshift clusters as it is to find them at 
lower redshifts. It is for this reason that the SZE has devel- 
oped into an important technique to search for clusters of 
gala xies with the aim to co nstrain cosmological parameters 
(e.g. ICarlstrom et al.ll2002h . A num ber of dedicated surveys 
have started to relea se results fe.g.. Iwilliamson et al.ll201ll ; 
iMarriage et al.ll201lh . 

An additional benefit of SZ observations is the fact 
that the amplitude of the SZE, quantified by the integrated 
Compton y— parameter Y, is expected to be a good measure 
of the cluster mass. It is, however, still sensitive to the clus- 



ter physics (e.g., iMcCarthv et al.ll2003al lbh and thus needs 
to be calibrated observation ally, ideally using weak gravi- 
tational lensing. For instance iHigh et al.l l|2012h present the 
first weak lensing masses from dedicated follow-up obser- 
vation s for clusters discovered by the South Pole Telescope 
fSPT; ICarlstrom et al.l (|201lr i). 

The first com parison of the SZE sig nal to lensing masses 
was presented in iMarrone et al. (2009) who found a fairly 
large scatter using a sample of 14 massive clusters. The lens- 
ing masses, however, were based on HST observations cov- 
ering only the cluster cores. This complicate s the mass esti- 
mate s and might lead to increased scatter (Hockstr a et al.l 
I2002T ). More recently. IMarrone et ail i|201lf) compared the 
SZE sign al for 18 clusters t o weak lensing masses deter- 
mined bv lOkabe et~ai1 (|2010aT ) using Subaru wide field imag- 
ing data. The resulting scaling relation is in good agreement 
with earlier studies t hat assumed hydrostatic equilibrium. 
IMarrone et~ai1 |201ll 'l find that a scatter of ~ 20% in weak 
lensing mass at fixed Fsz, although they note the scatter 
depends whether clusters are classified as disturbed or not. 

In this section we compare our weak lensing masses 
to available measurements of the SZE signal from the lit- 
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Figure 5. ie/t panel: Plot of the projected weak lensing mass as a function of SZ signal. The mass is measured within an aperture of 
radius ffsoo as determined by Bonamente et al. (2008) based on a joint analysis of X-ray and SZ data. The solid red line indicates the 
best fit power law model when the merging cluster A2163 (indicated) is excluded, whereas the dashed line is for the full sample. Right 
panel: Weak lensing mass within ©^g as a function of projected Y from Planck Collaboration et al. (2011). 



erature. We cons i der t wo samples, which partly overlap. 
iBonamente et all (|2008l ) present results for 38 massive clus- 
ters with BIMA and OVRO, 19 of which overlap with our 
sample. The second compariso n is with the overlap of 18 
cluste rs that were released in iPlanck Collaboration et al.l 
(2011) as part of the Planck early results. Our aim is not 
to calibrate the scaling relations but to examine the poten- 
tial of the SZE signal as a mass-proxy. The derivation of a 
useful scaling relation would require the apertures for the 
SZE results to be matched to the lensing radii, a s is done 
for the X-ray observation in iMahdavi et al.l (|2012l ) . 

IBonamente et al] (|2008l ) list their estimates for rff o> 
based on a joint analysis of the available SZ and X-ray data. 
Their results are reproduced in Table [4] As the SZ mea- 
surement is a projected quantity (although not as extreme 
as the lensing signal), we compare to the projected lensing 
masses within an aperture rffoo > which are also listed in Ta- 
ble |4] Assuming a constant gas fraction and self-similarity, 
the SZ signal Y scales with mass as M 5/3 oc D 2 A E{ z)~ 2,3 Y 
(e.g. iMcCarthv et alj|2003al : IBonamente et al . 2008). We as- 
sume this holds and take D 2 A E(z)~ 2 ^Y as the proxy for 
mass. The left panel of Figure [5] shows the projected lens- 
ing mass within r ffoo a s a function of the SZ signal from 



IBonamente et al.l ((2008). 



We fit a power law model to the measurements, fol- 
lowing the procedure described in §5.1. For a pivot value 
D 2 A E(z)~ 2ri Y = 10~ 4 we find a best fit normalization of 



andaslopea = 0.48io;p7, 



M = (5.16±0.24) x lO 14 ^ 1 M G 
somewhat shallower than a = 0.6 expected from the self- 
similar model. We obtain an intrinsic scatter of 12 ± 5% in 
mass for fixed Y. Note that ou r scatter is low e r, bu t con- 
sistent with the values found by iMarrone et al.l (|201ll ) . The 
merging cluster A2163 is the main outlier and if we remove 



Table 5. Best fit parameters of the SZE scaling relations 
Bonamente all without A2163 



CT iog(y|M) 

M 

a 


o io+°- 05 

u - lu -0.04 
5.31 ± 0.24 


06+0' 05 
5.51 ±0.25 

n r 7 +0.09 
u -°'-0.08 


a log(M\Y) 

M 
a 


0.05 ± 0.02 
5.16 ±0.24 
0.46 ± 0.07 


04+0- 02 
5.43 ± 0.25 
^+°- 09 

u - oo -0.08 


Planck 


all 


without A2163 


CT log(y|M) 
Mo 

a 


17+0-09 

u - x '-0.06 

11.0 ±0.7 
47+0- 11 

u -^' -0.09 


12+0' 07 

U - -0.05 

11.2 ±0.7 
60+° 14 


a \og(M\Y) 

Mo 

a 


076+ - 025 
10.4 ±0.6 
0.45 ± 0.11 


10.6 ±0.6 
0.56 ±0.16 



For the com p arison with the measurements from 
IBonamente et al] d2008f) . Mo is the normalization of the 
best fit power law with slope a to the projected weak lensing 



aperture mass within r^oo m un it s of 10 14 /i 7 q 1 Mq, for a pivot 
D'j^E(z)~ 2 / i Y = 10~ 4 . We present parameters for a log-normal 
intrinsci scatter in Y for fixed mass, a\ g(Y\M) an d v i ce versa, 
where we use the logarithm with base 10. For the com p arison 
with the Planck result from IPla nck Collabora tion et alj d201 lh 
we use a pivot of 2 X 10 — 4 . For both samples we present results 
for the full sample and when Abell 2163 is excluded. 



this cluster the results are consistent with no intrinsic scat 
ter and the slope steepens to a = 0.57lg'og 



with the self-similar prediction a nd the slope of 0.60 +0 ' 08 
found by IBonamente et al.l l|2008l ) for their full sample 



m agreement 

0.08 
0.06 
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IPlanck" Collaboration et al.l (|201ll ) provide the inte- 
grated y-parameter within an aperture of radius 505ooj 
where Qsnn is deter mined from X-ray observations by 
iPiffaretti et all (|201lh . Table U lists the Planck measure- 
ments and our projected lensing masses within 6500- The 
right panel of Figure [S] shows the weak lensing mass within 
this radius as a function for the SZE signal measured by 
IPlanck Collaboration et al.l |201ll 'l. For these data we choose 
a pivot value D 2 A E{z)~ 2/3 Y = 2 x 10" 4 , which gives a best 
fit normalization of M = (10.4 ± 0.6) x 10 14 /if 1 M Q and 
a slope a — 0.45 ± 0.11. The intrinsic scatter is 19lg% 
in mass for fixe d Y, consistent w ith the result f or th e 
iBonamente et a l. (2008) sample and lMarrone et all |201l| y 
As before, omitting A2163 from the analysis steepens the 
slope to a = 0.56 ± 0.16. 



6 CONCLUSIONS 

The Canadian Cluster Comparison Project targeted 50 mas- 
sive X-ray luminous clusters of galaxies with redshifts 0.15 < 
2 < 0.55, with the aim of studying the scaling relations be- 
tween cluster mass and baryonic tracers and to probe the 
cluster-to-cluster variation in the thermal properties of the 
hot intracluster medium. In this paper we present the re- 
sults of our weak lensing analysis of deep wide-field imaging 
data obtained using the CFHT. 

To ensure a significant detection of the lensing signal, 
clusters with an ASCA temperature of Tx >5keV from 
iHorneri <|200ll ) were the main targets of the CCCP. This 
led to an overrepresentation of z ~ 0.2 clusters. Although 
the sample lacks a well-defined selection function, it ap- 
pears to be represent ative, based on a co mparison of the 
the Lx-Tx relation |Mahdavi et alj|2012ft . We include two 
additional clusters that were located in the observed field- 
of-view. Hence we determine weak lensing masses for a to- 
tal sample of 52 clusters of galaxies. We upda t e the masses 



Hoekstral (120071), using 



Duffy et all (|2008l ) and 



for 20 clusters studied previously in 
the mass-concentration relation from ' 
present new results for 32 clusters observed with MegaCam. 

We measure the lensing signal out to large radii, which 
allows us to determine aperture masses, which are nearly 
model-independent. To allow comparison with other observ- 
ables we deproject the masses. We also fit NFW models to 
the data and explore the sensitiviy of our results to the 
adopted mass-concentration relation. The values of M500 
based on the aperture masses are robust, with a 20% increase 
(decrease) in the normalization of c(M) resulting in a 4% in- 
crease (6% decrease) in the mass. The aperture masses agree 
well with the results from fitting NFW models. Although we 
cannot rule out a lower normalization, a significant increase 
in the concentration at a given mass leads to inconsistent 
values for M500 when comparing the masses from the NFW 
fit to the aperture masses. 

The aperture masses are the reference for the compar- 
ison of cluster properties at other wavelenghts. The scal- 
ing relation between a r ange of X-ray propert ies and lens- 
ing mass is presented in Ma hdavi et al.l (|2012r ). In this pa- 
per we limit the comparison to published measurements of 
the Sunyaev-Zel'dovich (SZE) effect. We study a sample of 
19 clusters that overlap with the study of lBonamente et alj 



(2008) and another sample of 18 cluster that were observed 
by Planck (|Planck Collaboration et alJlioTlh . 

The SZE signal correlated well with the projected 
lensing mass. For both samples we find a best fit slope 
of the power law scaling relation that is lower than the 
value of a — 0.6 for self-similar models. However, when 
the merging cluster A2163 is excluded, in both cases the 
slopes are in agreement with the self-similar prediction. 
We find an intrinsic scatter of 12 ± 5% in projected mass 
(-M2500) for fixed Y f or the clusters that overlap with 
IBonamente et al. J2008I). The comparison with the Planck 
results |Planck Collaboration et alj|201ll 'l yields an intrinsic 
scatter in projected mass (M500) of I9±l% at fixed Y. 

The scat t er ag rees well with the results from 
iMarrone et al.l |201ll ) and is comparable to the scatter 
in the scaling relations with (more expensive) X-ray ob- 
servables such as hy d rostatic mass and te mperature (e.g., 
iMahdavi et~ai1 |2012| ; lOkabe et all l2010bl ). This demon- 
strates that the SZE signal is a competitive proxy for cluster 
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